Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models

نویسندگان

چکیده

Abstract The splitting of words into stressed and unstressed syllables is the foundation for scansion poetry, a process that aims at determining metrical pattern line verse within poem. Intricate language rules their exceptions, as well poetic licenses exerted by authors, make calculating these patterns nontrivial task. Some rhetorical devices shrink length, while others might extend it. This opens door interpretation further complicates creation automated algorithms useful automatically analyzing corpora on distant reading fashion. In this paper, we compare identification systems available Spanish, English, German, against fine-tuned monolingual multilingual models trained same Despite being initially conceived suitable semantic tasks, our results suggest transformers-based retain enough structural information to perform reasonably Spanish setting, outperforms both English German when using model three languages, showing evidence benefits cross-lingual transfer between languages.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiobjective Optimization for Meaningful Metrical Poetry

This paper reports our experiments to properly handle the multiobjective optimization nature of poetry generation — as defined in Manurung (2003) — as a stochastic search that seeks to produce a text that simultaneously satisfies the properties of grammaticality, meaningfulness, and poeticness. In particular, we employ the SPEA2 Algorithm (Zitzler, Laumanns, and Thiele 2001). Various results sh...

متن کامل

A chart generation system for topical metrical poetry

Several poetry generation systems that are in some way inspired or motivated by existing articles such as newspaper stories have recently appeared. However, most if not all of them employ template-based generation, which limits both the expressiveness of the system and the ability to faithfully convey the message of the source article. In this paper we present our work on a poetry generation sy...

متن کامل

Rhythm Analysis and Linear Modeling of Metrical Poetry Respiratory Signal

The paper studies the breathing patterns for four types of Chinese metrical poems, namely, fivecharacter quatrain, five-character octave, sevencharacter quatrain and seven-character octave. A breathing belt was tied to the chest and the respiratory signal was acquired by EMG. Respiratory parameters were defined regarding the dynamical properties of the signal, and were automatically extracted b...

متن کامل

Machine Learning for Metrical Analysis of English Poetry

In this work we tackle the challenge of identifying rhythmic patterns in poetry written in English. Although poetry is a literary form that makes use standard meters usually repeated among different authors, we will see in this paper how performing such analyses is a difficult task in machine learning due to the unexpected deviations from such standard patterns. After breaking down some example...

متن کامل

Efficient Handling of Multilingual Language Models

In this paper we introduce techniques for building a multilingual speech recognizer. More specifically, we present a new language model method that allows for the combination of several monolingual into one multilingual language model. Furthermore, we extend our techniques to the concept of grammars. All linguistic knowledge sources share one common interface to the search engine. As a conseque...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Computing and Applications

سال: 2021

ISSN: ['0941-0643', '1433-3058']

DOI: https://doi.org/10.1007/s00521-021-06692-2